Self-Learning Multi-Objective Service Coordination Using Deep Reinforcement Learning
نویسندگان
چکیده
Modern services consist of interconnected components, e.g., microservices in a service mesh or machine learning functions pipeline. These can scale and run across multiple network nodes on demand. To process incoming traffic, components have to be instantiated traffic assigned these instances, taking capacities, changing demands, Quality Service (QoS) requirements into account. This challenge is usually solved with custom approaches designed by experts. While this typically works well for the considered scenario, models often rely unrealistic assumptions knowledge that not available practice (e.g., priori knowledge). We propose DeepCoord, novel deep reinforcement approach learns how best coordinate geared towards realistic assumptions. It interacts relies available, possibly delayed monitoring information. Rather than defining complex model an algorithm achieve objective, our model-free adapts various objectives patterns. An agent trained offline without expert then applied online minimal overhead. Compared state-of-the-art heuristic, DeepCoord significantly improves flow throughput (up 76%) overall utility (more 2x) real-world topologies traces. also supports optimizing multiple, competing objectives, respect QoS requirements, generalizes scenarios unseen, stochastic scales large networks. For reproducibility reuse, code publicly available.
منابع مشابه
Multi-Objective Service Composition Using Reinforcement Learning
Web services have the potential to offer the enterprises with the ability to compose internal and external business services in order to accomplish complex processes. Service composition then becomes an increasingly challenging issue when complex and critical applications are built upon services with different QoS criteria. However, most of the existing QoS-aware compositions are simply based o...
متن کاملMulti-Objective Deep Reinforcement Learning
We propose Deep Optimistic Linear Support Learning (DOL) to solve highdimensional multi-objective decision problems where the relative importances of the objectives are not known a priori. Using features from the high-dimensional inputs, DOL computes the convex coverage set containing all potential optimal solutions of the convex combinations of the objectives. To our knowledge, this is the fir...
متن کاملA Multi-Objective Deep Reinforcement Learning Framework
This paper presents a new multi-objective deep reinforcement learning (MODRL) framework based on deep Q-networks. We propose linear and non-linear methods to develop the MODRL framework that includes both single-policy and multi-policy strategies. The experimental results on a deep sea treasure environment indicate that the proposed approach is able to converge to the optimal Pareto solutions. ...
متن کاملMulti-Objective Reinforcement Learning
In multi-objective reinforcement learning (MORL) the agent is provided with multiple feedback signals when performing an action. These signals can be independent, complementary or conflicting. Hence, MORL is the process of learning policies that optimize multiple criteria simultaneously. In this abstract, we briefly describe our extensions to single-objective multi-armed bandits and reinforceme...
متن کاملWhy Multi-objective Reinforcement Learning?
We argue that multi-objective methods are underrepresented in RL research, and present three scenarios to justify the need for explicitly multi-objective approaches. Key to these scenarios is that although the utility the user derives from a policy — which is what we ultimately aim to optimize — is scalar, it is sometimes impossible, undesirable or infeasible to formulate the problem as single-...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Network and Service Management
سال: 2021
ISSN: ['2373-7379', '1932-4537']
DOI: https://doi.org/10.1109/tnsm.2021.3076503